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Abstract In this paper nonhomogeneous deterministic and stochastic Maxwell equations 
are used to rigorously formulate the capacity of electromagnetic channels such as wave guides 
( cavities, coaxial cables etc). Both distributed, but localized, and Dirichlet boundary data 
are considered as the potential input sources. We prove the existence of a source measure, 
satisfying certain second order constraints (equivalent to power constraints), at which the 
channel capacity is attained. Further, necessary and sufficient conditions for optimality are 
presented. 
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1 Introduction 



Channel capacity for MIMO channels (multiple input multiple output) has been the subject 
of intense study in recent years. Most of the papers have been concerned with a strictly 
information theoretic analysis [13, 8]. On the other hand, channel capacity can be treated as 
an optimization problem subject to the constraint imposed by the Maxwell equations [12]. In 
this paper, a mathematical framework for MIMO capacity is provided using Electromagnetic 
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constraints of the channel. It is shown that this problem can be rewritten as an optimal 
control problem where the control is the source measure subject to moment constraints 
equivalent to transmitter power constraints. 

Rest of the paper is organized as follows: The current section ends after a brief list of 
notations. In section 2, we present the dynamic models of electro-magnetic channels. Both 
distributed and boundary sources are considered, and existence and regularity properties of 
solutions of the dynamic systems are presented. In section 3, communication problems are 
formulated. Section 4 deals with the solution of the problems proving existence of control 
measures from the admissible class at which channel capacity is attained. In section 5, nec- 
essary and sufficient conditions of optimality are presented whereby a numerical algorithm 
can be developed for capacity computation. The paper is concluded with some comments 
in section 6. 

Some notations: Let S denote an arbitrary set and T the Borel algebra of susbsets of 
the set S. We call the pair (S, JT) a measurable space. Let {/x, v) be any two regular Borel 
measures on the measurable space We let /x ^ i/ to denote the absolute continuity 

of the measure \i with respect to the measure v. The Radon-Nikodym derivative, if it exists, 
of \JL with respect to v is denoted by ^pjy = 9{x)^ where g G v). 

For any pair of Banach spaces X, y, we let C{X^ y) denote the space of bounded linear 
operators from X to Y. For any bounded open connected domain C -R" with sufficiently 
smooth boundary H^{n,R^) C L2{^,R"^),s > 0, will denote the standard Sobolev 
spaces of functions defined on and taking values from R"^ whose generalized derivatives 
up to order s belong to L2{0,,R"^. Similarly H~^{n,R"^),s > 0, will denote the Sobolev 
spaces with negative exponents. These are distributions and, under some assumptions, are 
the topological duals of H^{Q, R"^^). By Sobolev's embedding theorem, it is known that 
for s > (n/2) + k, H'{n,R^) C''{n,R^). Thus the Dirac measure S^jidx) with mass 
concentrated at lo e CI, bSu, € H~^{Q, RJ^) for any h G C(0, RJ^) and s > in/2). Note that 
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for s > 0, a continuous linear functional i on H^{D,, R^) has the representation 




Ix 



for some tp G i?'"). 

For example, for any / G L2{0,, BJ^) with Oq being constants, the function ip, given by 



multi index and \a\ = ^i'=icti,ai > and -D"/ denotes the distributional derivative of / 
of order |a|. For fractional s, the Sobolev spaces are defined by use of Fourier transform. 

2 Channel Dynamics 

In this section we present several models that describe the channel dynamics. The first 
model is assumed to satisfy homogeneous Neumann boundary condition (no leakage) with 
input source being a vector of current and charge density. The second model consists of 

homogeneous wave cqiiation describing the electric field with nonhomogeneous Dirichlet 
boundary data whereby the input or source is provided. 

2.1 Channel with Current and chcirge as input Sources 

First we consider channels with homogeneous Neumann boundary condition. In this case 
the system is governed by a system of wave equations arising from Lorenz transformation 
of Maxwell's equations. The electromagnetic waves are generated by input sources such as 
current and charge densities and are confined in a wave guide. The electrical signals in the 
wave guide are governed by Maxwell's equations. Using the vector and scalar potentials 
denoted by (a, if) and the Lorentz gauge, the Maxwell's equations are given by a system of 
wave equations: 



V' = E|a|<, 



g aaD'^f, is an element of H ^{Q., BJ^). Here a = (ai, a2, • • • 



an) stands for the 



d'^a/dt'^ - {l/ne)Aa = {l/e)i,t >0,^encR^ 
d^^/dt^ - (l/Me)A99 = (l//xe)p. 



(1) 



(2) 
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where i and p are the sources, the first denoting the current density (vector) and the second 
the charge density. These are the sources that can be controlled to produce desirable field 
distributions inside the wave guide. The field variables {£', B} are related to the potentials 
by the following equations: 



These models are useful in various fields of communication such as radar, optical fibre etc 
[?] (see references therein.). We let d B? denote an open bounded connected domain 
(representing the waveguide) having piecewise smooth boundary. 

Define H = L2{^,B?) x L2{^,R) = L2{^,R^) , denote y = {a,ip) , and define the 
formal differential operator C by Cy = {1/ fie){Aa, Atp) and let B denote the Neumann 
boundary operator and set B{a, tp) = 0. This operator is simply the outward normal deriva- 
tive of the arguments at every point on the boundary of the wave guide. Then introduce 
the operator A as follows: 



Under the given boundary condition, —A is an unboTindcd positive self-adjoint operator 
in H. Define the state space as H = D{^/^A) x H and the state as z = (y, y). This is 
the energy space. Furnished with the scalar product and the associated norm as presented 
below. 



7^ is a Hilbert space. Note that the first term represents the potential energy and the second 
the kinetic energy (magnetic field energy). Then we define the system operator A and the 
control operator B as follows 



E = -{a + Vip), B = Vxa. 



D{A) = {yeH: B{y) = and Cy e H} C H^{n, R^) x H^{n, R) 



and set Az = Cz for z e D{A). 



{x, z)u = {sf^Axi, ■\/^Azi)h + {X2, Z2)h 
II ^ hi= ( II \f^Ax\ fji -I- II X2 fii 
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Define the input or the control vector as 



u 



■ 

\ P J 



Using these notations the system of wave equations given by (1) and (2) can be written as 
an abstract differential equation on the Hilbert space H as follows 



where A is an unbounded operator with domain and range in H. In practice the input is 
localized. Let ftg G ft he a part of the domain at the input end of the wave guide and 
consider the Hilbert space U = L2{^o,R^) with the standard scalar product. We may 
assume that the controls are functions of time taking values from the Hilbert space U. 
Thus our admissible source is a proper subset Uad C L2{I, U). It can be shown that on the 
Hilbert space TL the system operator A is skew adjoint and hence iA is self adjoint. Thus it 
follows from semigroup theory ([l],Theorem 3.1.4, p71), in particular Stones theorem, that 
A generates a unitary group of operators S{t), t e R. Using this unitary group of operators 
we can write the solution (mild) of equation (3) as follows 



Jo 

Note that in the absence of external input u, the system is conservative and 

II z{t) \\n=\\ S{t)zo \\n=\\ zo \\n V t g R. 
This can be proved by simply scalar multiplying in H on either side of the equation 

z = Az, z{0) = Zq 
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z = Az + Bu, t>0, 



(3) 




(4) 



by z and integrating and noting that (.4^,C)w = 0,V ^ G D[A). We summarize the above 
results in the following theorem. 



Theorem 2.1 For every input u G ^2(1^ U) and initial state zq G H, the system (3) has a 
unique mild solution z G C{I,7i). Further the solution is given by the expression (4). This 
in turn implies that the system of wave equations (l)-(2) has a unique mild solution for 
every given initial state in the energy space and every given finite energy input. 

2.2 Channel with Dirichlet Data as Input Source 

In cgs units, the Maxwell equations for electric field E and magnetic field B are given by 

V xB={l/c)dE/dt + {A7r/c)i (5) 

V xE = -{l/c)dB/dt, (6) 
V-E = ATrp, V-B = 0, (7) 

where c denotes the velocity of light and the pair {i, p} denotes the current density vector 
and charge density respectively. Here we have used standard notations for curlcf) = V x 
and divcf) = V ■ (f). Using the first identity of equation (7), the reader can easily verify that 

V X V X = -/SE + 47r(Vp). 

Now applying the curl operator on either side of equation (6) and using equation (5) one 
can easily verify that the electric field E satisfies the following wave equation 

d'^E/df - c^AE = -An{di/dt + c^Vp). (8) 

Since here we are interested in boundary data, we assume that both the current and charge 
densities are identically zero. In order to solve such equations in any bounded domain one 
must specify the initial and boundary conditions. Hence the complete system equation is 
given by 

d'^E/dt'^ -c^AE = 0,^ en,t>0, (9) 
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E{o, = MO, E{o, = e e n, (10) 

E{t,^)\dn = uit,O,^&dn,t>0. (11) 

This is a initial boundary value problem with nonhomogeneous Dirichlet boundary condi- 
tion. In general the source u carries the information to be transmitted over the wave guide 
channel J7. 

There are two possible ways of attacking this problem. One is the semigroup approach 
([2]) and the other is based on the principle of transposition ([10]). 

Method A (Semigroup Approach): The first method is based on a well known technique 
([2],p59-63)(and the references therein) whereby one can transfer the boundary data to the 
righthand side of the original differential equation. We write equation (9)-(ll) as a system 

de/dt = Le,eiO) = eo (12) 
Be = Trei = u (13) 



where e = {E, E), Trcp = and 

L = 



/ 

c^A 



Note that this is a 6 x 6 matrix with the elements of the first and the fourth diagonal blocks 
being all zero and the second block being a 3 x 3 identity matrix and the third diagonal block 
is a 3 X 3 diagonal matrix with the elements being the Laplacian A. To avoid introducing 
new notations, we use the same symbols to define the operators A, A, Bq by 

A = C^IA\kerB and A = L\kerB and Bq = B\kerL- 

Note that the operator A is & negative self adjoint unbounded operator on H = L2{^, R^). 
The domain of the operator A is given by 

D{A) = H^{n,R^)nH^{n,R^) xL2{n,R^) cn 

where H is the energy space, 

n = D{V^A) X H, 
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considered here as the state space. Now returning to the system model, it is not difficult to 
verify that the operator A is closed and densely defined and that for any R3 X^O, 

II i2(A,^ 11=11 {XI -A)-' \\h < (1/|A|). 

Thus by Hille-Yosida theorem ([1], Theorem 2.2.8,p27) , A is the infinitesimal generator of a 
Co-group S{t),t € R, of contractions in TC. Further, it is easy to verify that the operator A is 
skew adjoint and hence by Stones theorem ([1], Theorem 3.1.4, p 71), it is the infinitesimal 
generator of a unitary group S{t),t G R, on H. Our objective is to convert the initial 
boundary value problem (12)-(13) into a Cauchy problem (initial value problem). Define 

w = H^{n, R^) X L2{n, R^) c n 

and set Wi = KerL,W2 = KerB. For any A(e R) / 0, define P = R{X,A){XI - L), 
and notice that P\y\>2 = I, the identity and that = P. Thus W admits the direct sum 
decomposition as follows, 

W = Wi® W2. 

Clearly R{X, A) G jCCH, VV2)- For the source space, let U be a linear subspacc of H'^/^(dQ, R^) 
carrying the structure of a Banach space such that -Bq '■ VVi — > U is surjective and 
3^i = {Bq)~^ G jC,{U,Wi). Now going back to our original problem (12)-(13), we can rewrite 
the first equation in the equivalent form 

de/dt = + (n - {XI - L))e (14) 

with n = {XI — A). For A G p{A), the resolvent set of A, the operator 11 has bounded 
inverse giving the resolvent R{X,A). Using the direct sum decomposition, wc can express 
the solution as the sum given by e = + e^, G Wi,e^ G W2. Substituting this in 
equation (14), and following similar steps as presented in ([2], p59-62), we arrive at the 
following abstract Cauchy problem 

C = ^C + A3fi!«,Co = C(0) = i?(A,^)eo, (15) 
K = {I-R{X,A){XI-L)), (16) 
e = nc. (17) 
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Using the unitary group introduced above, the mild solution of the system (15)-(16)- (17) 
is given by 

C{t) = S{tXo + f S{t - s)A^u{s)ds, t e I, (18) 

^0 

e{t) = UC{t),teI. (19) 

Briefly this is the first method. It is clear from the expression (18) that, for every u G 
L2{I,U) and Co e D{A), C e C{I,D{A)) and hence it follows from (19) that e G C{I,n). 
We collect these facts together in the following theorem. 

Theorem 2.2 For every eg e H and u G L2{I,U), the initial boundary value problem 
(12)-(13) has a unique mild solution e G C{I,H), and it is given by the expressions (18) 
and (19). 

Method B (Principle of Transposition): The second method, which admits much more 

general boundary data, is the method of transposition ([10], p231, p283).This method ad- 
mits L2{dn) data and, more generally, data from Sobolev spaces with negative norm like 
H~^/'^(d^l). The method consists of constructing a suitable isomorphism and then transpos- 
ing the isomorphism for the solution of nonhomogeneous Dirichlet problems like (9)-(ll). 
Consider the homogeneous Dirichlet problem 

Lij; = d'^i)/df - c^Ai/j = f,Cen,t>0, (20) 
ilj{T,O=0,^{T,O=0,^en, (21) 
Ht,O\dn = 0,iedn,t>0, (22) 

for / G L2{Q, R^) = L2{I, L2{0,, R^)). Reversing the flow of time, it follows from the results 
of the previous subsection that for every / G L2{Q,R^) this problem has a unique solution 
V' G H^{Q,R^). Now introduce the vector space \1' by 

* = |V G L2{Q, R^) : G L2{Q, R^),^P{T, •) = 0, i;{T, •) = 0, Vl/xan = o| 

and furnish it with the norm topology given by 

II V' ll*=ll ||l2(q,r3) . (23) 
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The reader can easily verify that * is a normed hnear space. Since L is a closed operator, 
it follows that is a Banach space, in fact a Hilbert space. Thus it follows from the given 
norm topology that L is an iosmetric isomorphism of * onto L2{Q, R^) = L2{I, L2{^, R^))- 
For convenience of notation we may express this fact by stating that 

LeIso{^,L2{Q,R^)). 

This is known as the adjoint isomorphism. Transposing this isomorphism, we can settle the 
question of existence of solution of our original nonhomogeneous boundary value problem 
(9)-(ll). This is stated in the following Theorem. 

Theorem 2.3 Consider the system (9)-(ll) and suppose Eq G H-^/'^{n, R^), Ei G H-^/^{n, R^) 
and u e L2{I, iJ-V2(aj^, R^)) c H'^l'^il x d^, R^). Then the system (9)-(ll) has a unique 
solution E G L2{Q,R^) = L2{I,L2{^,R^)). 

Proof Formally, scalar multiplying equation (9) by any ip E and using Greens formula 
for integration by parts, one can easily derive the following identity 

/ {E, L^)d^dt 
Jixn 

= [ iEi,i;iO))d^ - [ (Eo, i^iOm I {Q^IQ^. OdaiOdt, (24) 

JQ JQ Jlxdn 

where dip/dv denotes the partial derivative of ip in the outward direction of the unit normal 
vector V at any position on the boundary dfi and a denotes the surface (Lebesgue) measure 
on the boundary. Define the functional 

iii^) = I V(0))de - / {EoAmdi -c" [ {d^l:/du,u{t,i))da{Odt. (25) 
Jo, Jn Jixdci 

Clearly this is a linear functional. Since ip G H^{Q,R?), it follows from standard trace 

theorems for Sobolev spaces that ■i/'(0) = V'(O)O)^ ^ ^5 

element of H^/'^{n,R^), 

V'(O) G H^/^{n,R^) and dip/du G H^/^{I x dCl,R^). Thus for the given data {Eo,Ei,u} 
with the regularities as specified in the statement of the theorem, the scalar products on 
the righthand side of the identity (25) have the correct duality pairings. Hence we conclude 
that the given data determines a continuous and hence a bounded linear functional I on 

10 



the Banach space ^. Since L e Iso{^,L2{Q,E?)), this means that the composition map 
{^oL''^) is a continuous Unear functional on L2{Q,R^). Hence, by Riesz representation 
theorem, there exists an unique E e L2{Q,R^) such that 

(foL-i)(/) = (£;,/)^,(Q,^3) V /GL2(Q,i?3). (26) 

Since L is an isomorphism, this is equivalent to saying that 

^(^) = (£;,L^)^,(Q,«3) V ^G*. (27) 

This also verifies the validity of the formal identity (24) obtained by integration by parts. 
The uniqueness is a consequence of the fact that L G 7so(*, L2{Q,R^)). This completes the 
proof. • 

Remark 2.4 It is clear from the above result that our nonhomogeneous Dirichlet initial 
boundary value problem (9)-(ll) has a unique solution E G L2{Q,R^) for a very general 
set of data form the class of generalized functions 

{Eo,Ei,u} G H-^/'^{n,R^) X H-^/^{n,R^) x i?-^/2(/ x dn,R^). 

For practical applications we may limit our data from the Hilbert spaces L2{^,R^) x 
L2{^, R^) X L2{I, L2{dQ, R^))- In this case, off course, we expect our solutions to be much 
more regular or smooth. 

Note that the data to solution map {Eq, Ei,u} — > E, which we denote by G, is a contin- 
uous linear map from H-^I'^{Q.,R^) x H-^/'^{n,R^) x i?-V2(/ x dn,R^) to L2{Q,R^) = 
L2{I, L2{^, R^)) and hence there exists a constant K > such that 

\\G{Eo,Ei,u) Wl^^q^rs^ < 

-^jll -^0 lliJ-l/2(n,il3) + II El ||iJ-3/2(Q^^3)|| + II U ||i/-l/2(Jxan,il3)|- 

Since for s > 0, the embeddings L2{Q) ^ H^^{Q) are continuous, it follows from the above 
result that for {Eo,Ei,u} G L2{n,R^) x L2{n,R^) x L2{I ,L2{dVL,R^)) 

II G{Eq,Ei,u) \\l2{q,b?) < 
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-^jll ^0 ||L2(f2,i?3) + II -^1 llL2a^,fi-^)ll + II ^ \\L2{I,L2{dn,R3))j, 

where K depends on K and the embedding constants 

In the study of communication problems we will set Eq = Ei = Q and consider the boundary 
data as the input source giving E = G{u) G L2{I,H). A complete characterization of the 
input-output map is given in section 3. 

Remark 2.5 It is interesting to note that Theorem 2.2 dealing with the question of exis- 
tence and regularity properties of solutions also provides a clue to numerical technique for 
solving the basic problem (9)-(ll). Let {fi} C L2{Q,R^) be a complete orthonormal set 
(orthonormality is not essential though linear independence is). Then note that 

(ioL-'M) = (£;,/i)L2{Q,R3) = Ci,ie N. 

These arc precisely the Fourier coefficients of E with respect to the complete set {fi} as 
indicated by the righthand expression, and hence E is given hy E = J2iZi Cifi- Further it 
is clear that determined entirely by the data {Eq, Ei,u} of the problem. 

Remark 2.6 Comparing method A (Semigroup Approach) with method B (Principle of 
Transposition), it is apparent that the later admits much more general data. At least for 
linear initial-boundary value problems, semigroup theory seems to be less powerful. 

3 Formulation of Communication Problems 
3.1 Distributed Source: 

Transmit End (T): First we consider the system model described by equation (3) with 
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the distributed source or control space U = ^2(^0 5 R"^)-, that is, 

U={ue L2{n,R^) : u{0 = 0,V ^ G 0\J^o}- 

For application to communication problems we may simplify the source further by taking 
a finite number of disjoint closed subsets {(Ti}"^j^ C and consider input source to be 
composed of the sum 

n 

u = J2xi{t)ipi{0,te I, no (28) 

1=1 

where the functions tpi G L2(rio,-R^) vanishing outside crj. In other words, these functions 
have CTj as their supports and Xj G L2{I) are scalar valued functions which are the signals. 
These represent message signals radiated by the strategically located n-transmit antennas. 

Receiver End (R): Let denote the receiving end of the wave guide. Sensors are located 
on this set. Again let {l3i}^i be a family of disjoint closed subsets of the set where the 
sensors are located. These sensors are assumed to be able to measure the electric field 
distribution on these patches. These represent receiving antennas. In terms of the vector 
and scalar potential {a,ip} we have already seen that the electric field vector is given by 

E = -(d + V(^). 

Hence in terms of the state variable we have 

E=-( Z6 + d2Z4 j = rz, (29) 

where F is the matrix of differential operators easily determined by the above relation. 
Clearly, F is a bounded linear operator from Tt to L2{n,, R'^). The outputs are the integrals 
of weighted sensor response to the electric field distribution on the patches. These are given 

by 

yi(i) = ^*(^ <aii^),E{s,0>R3da{0^ds + Wi{t), t e I ,i = 1,2 ■ ■ ■ ,m, (30) 

where is the vector of weight given to the measured electric field distribution on i-th 
site and Wi represents the measurement noise of this site. The weight vector aj may be 

13 



assumed to be supported on the set /Jj. We assume that {tUij^i are mutuahy independent 
standard Brownian motions with W denoting the corresponding m vector Brownian motion. 
Throughout the paper we use {E,T,Tt,P) to denote the filtered probabihty space where 
^t,t ^ 0, is an increasing family of right continuous subsigma algebras of the sigma algebra 
T. All random processes arising in this paper will be assumed to be based on this complete 
filtered probability space. 

Now returning to our problem and using the source and the output models as described 
above, the state and the measurement dynamics turn out to be 

z = Az + Cx,z{0) = zo, (31) 
dy = gVzdt + dW (32) 

where the operators {C,Q} are given by 

n . 

The reader can easily verify that the operators C : — *■ H and (QT) : H — > EJ^ and 
they are bounded linear operators. Using the semigroup S{t),t > 0, corresponding to the 
operator A, it follows from the expression (4), that 

z{t) = Kt{x) = S{t)zo + f S{t- r)Cx{r)dr. (33) 

Define the composition map F with values 

Ft{x) = {gVKt){x). (34) 

Clearly F is a nonanticipative (causal) operator mapping L2(/, i?") to L2{I,R^)r\C{I^BJ^) 
and, being the composition of bounded linear operators, it is also a bounded linear operator. 
Thus the output equations (30) can be written as as a linear stochastic differential equation 
in EJ^, 

dy = Ft{x)dt + dW, y{0) = 0,tel. (35) 

In case of method B, the map F is given by the composition map Ft{x) = {QGC)t{x), which 
is a bounded linear operator from L2{I, BP) to L2{I, R"^). 
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3.2 Boundary Source: 



Next we consider the model described by the boundary value problem (9)-(ll). Here we 
consider a part dQo C dO, of the boundary dil where the source is active. Then for the 

source space wc take L2{I, U) where U is a closed linear subspace of H'^/'^{d^lo, R^) = {93 G 
H^/^{dn, : (p{^) = for e dno}. Aga in we let {ai} denote a family of disjoint subsets 
of the set d^o and model the input as 

n 

u{t, = E ^iit)MO,t e ^ e 9^0 (36) 

i=l 

where ipi £ U and supported on the set cxj and Xj G L2{I). The complete system model is 
then given by 

d'^E/dt^ - c^AE = 0,^ en,t>0, (37) 
E{0,O=0,E{0,O=0,^en, (38) 

n 

Eit,0\dn = u{t,0 = e dn,t> 0. (39) 

i=l 

For this model, we can use the representations (18)-(19) and (39) to construct the output 
equation. Define the map 

Kt{x) = u(^S{t)Co + J^S{t - r)A^Cx{r)dr^ ,tel. (40) 

Let r denote the projection map Fe = ei which projects e = (ei, 62) = {E, E) to the first 
component ei = E. Using these maps, again we can write the output equation in the same 
general form (35), 

dy = {gTKt){x)dt + dW 

= Ft{x)dt + dW,teI. (41) 

The existence of the map F is assured by the expressions (18)- (19) as presented in the 
semigroup approach (method A). 
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3.3 Noisy Source 



So far we have assumed that the source is noise free. In order to admit noisy source one 
must add some compatible additional terms to the evolution equations (3) and (15). For 
the distributed source, we replace the evolution equation (3) by the stochastic differential 
equation 

dz = Azdt + Budt + adW°, t > 0, (42) 

on the Hilbert space H, where the operator a is given by a 8 x 4 matrix of operators 
with the first four rows being all zero and the remaining 4x4 matrix is a diagonal matrix 
of operators {cj, i = 1,2,3,4}. The Brownian motion W° is given by the vector W° = 
co\{Wi, W2, W3 , W4 } of independent Brownian motions 

each taking values possibly from ^2(0, -R). In reference to the field equations (1) and (2), 
this means adding distributed white noise on the righthand side of each of the equations in 
the form aiW-'{t) = (Ti{-)W°{t, = 1, 2, 3, 4. This model allows one to deal with localized 
as well as distributed noise around the wave guide. Letting V denote any separable Hilbert 
space, for example a closed linear subspace of L2(r2, i?^), we may assume W° to be a F 
valued Brownian motion, independent of the Brownian motion W (receiver noise), with 
covariance operator denoted by Q° and a G JC{V,H) so that Q° = aQ°a* is a positive 
nuclear operator in H. A natural choice for the space V is L2{0,q,R'^) = U, same as 
the source space, and a = B. In any case this choice is determined primarily by physical 
requirements and mathematical simplicities. Since S{t),t & R, is a unitary group, it is easy 
to verify that S{t)Q°S*{t) is a positive nuclear operator in H for alH G -R given that 
is. Thus for each u G L2{I,U), equation (42) has a unique mild solution z, which belongs 
to C{I,H) with probability one, possessing bounded second moments. In this case the map 
Kt{x),t > 0, is given by 

Kt{x) = z{t) = S{t)zo + t S{t- r)Cx{r)dr + f S{t - r)adW". (43) 

Jo Jo 
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Considering the boundary value problems, for noisy boundary source, equation (15) is 
replaced by 

d( = {AC + A^u)dt + A^adW", t > 0. (44) 

For the boundary source we had chosen U C H^/'^{d^Q,B?). Thus it is necessary that 
(T G C{V, U) where V is any separable Hilbert space supporting the Brownian motion W°. 
For example, V = L2{dQo, R'^) or any closed linear subspace thereof. Again for the existence 
of mild solutions e = 11^ € C{I, H), it suffices if A^aQ°a*^*A* is a positive nuclear operator 
in 7^. In this case, equation (40) is replaced; and the process Kt{x),t > 0, is given by 

Ktix) = U (^S{t)Co + j^^ S{t - r)mCx{r)dr + J^* S{t - r)A^adW°^ ,tel. (45) 

Remark 3.1. In case the sensors (receiving antennas) are nonlinear, the operator Q is 
nonlinear and hence the composition map Ft, t e I, is also nonlinear. The results presented 

in this paper remain valid provided this nonlinearity is uniformly Lipschitz having at most 
linear growth. 

4 Channel Capacity 

In view of the preceding discussions, we notice that the input and output spaces are given by 
X = L2{I, i?") and Y = C{I, R^). Suppose these spaces are furnished with the (topological) 
Borel algebra turning them into measurable spaces {X,Bx) and {Y,By)- Let M{X) and 
M(Y) denote the space of Borel probability measures on {X, Bx) and (F, By) respectively. 
Considering the source space, let M2{X) C M{X) denote the class of probability measures 
having finite second moments, that is, 

/X G M2{X) if and only if / \x\\ ^x{dx) < oo. 

Jx 

Since normally the source power is limited, we consider a bounded subset of M2{X) given 

by 

Sr = {^ie M2{X) : / \xfx l^{dx) < rT} 
Jx 
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where r > is the power constraint and T is the length of the time interval / denoting the 
duration of the message source. For admissible source measures, we can choose any set M.ad 
which is a weakly compact and convex subset of the set Sr- For example, M.ad = wc£{rr) 
where is any convex subset of the set Sr satisfying 

lim sup / (x , Ci)^ u,(dx) = 

for any orthonormal basis {cj} of the Hilbert space X. Under this assumption, the set F^ is 
uniformly tight and hence conditionally weakly compact. Thus its weak closure is weakly 
compact. For more concrete examples of compact sets Mad see Remark 4.2 following 
theorem 4.1. 

Considering the output space {Y,By), let M{Y) denote the space of regular Borel 
probability measures on it. Let M{X x Y) denote the space of joint Borel probability 
measures on the product sigma algebra Bx x By- We have seen in section 3, that the 
output signal y is related to the input process x through the communication system (31)- 
(32) leading to (35) for the distributed source; and (37)-(39) leading to (41) for the Dirichlet 
source. In other words, for a given probability measure /x G C M2{X) on the input space, 
there is a unique measure u G M{Y) on the output space Y induced by the channel. Let 
7 G M{X X Y) denote the joint probability measure and x u the product measure with 
fi and V being the marginals of 7. The relative entropy of 7 with respect to the product 
measure ji x u, denoted by I{X^y), is called the mutual information which is a measure 
of the amount of information carried by the observable noisy output y about the input 
message (source) X. This is given by the following expression, 

/,^,y)./_<»,(;^^^^).(.xx.,), ,4.) 

where T(x,y) = i2dx)xv(dy) 'isnotes the Radon-Nikodym derivative of 7 with respect to the 
product measure. Clearly this requires that 7 be absolutely continuous with respect to the 
product probability measure ^ x v. Note that the output measure v is related to the input 
measure /x through the channel operator and it is given by 

v{D) = 7(X xD)= [ q{x,D)n{dx), ^ D e By (47) 
Jx 
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where 

q{x,D) = Pr{y e D\x} 

is the conditional probability of the output y being in D € By given that the input x G X. 
This is precisely the action of the channel on the input and, as we have seen in the preceding 
sections, it is determined by the dynamic models of the channel. A closed form expression 
for this will follow shortly. Substituting the expression (47) into the expression (46) we 
obtain 

I(X,y) . JW . j^Joy[^J^^y.,iyU^) (48) 

which is a functional of the measures q and (i. Since the channel dynamics is given, this is 
a functional of the source measure only as indicated above. We have seen in the preceding 
section that, for both the distributed and the boundary sources, the output equation has 
the general form given by a linear stochastic differential equation in i?"*, 

dy = Ft{x)dt + dW, 2/(0) = 0, (49) 

where F is the causal (nonanticipativc) map F : L2(/, i?") — > L2{I,R"^) as described 
earlier. This is a continuous linear map. Now it follows from equation (49) that for every 
given X E X, q{x,-) is & Gaussian measure on Y with mean trajectory given by 

F{x) = {C Fs{x)ds, tel} eY (50) 
Jo 

while the covariance operator Qi is given by 

{Qii,0= I {Ki{t,s)as),m)dsdt,^eY, 

with the kernel of the operator Qi being Ki(t,s) = (t A s)Im,{t,s) & I x I, and is 
the identity matrix of dimension m. Thus the Channel Kernel is given by the conditional 
Gaussian measure 

q{x,D) = Ng{F{x),Qi){D),x e X,De By. (51) 

Since F is a continuous linear map from X to Y, and Range{F{x)) C Range{Q\^'^) for all 
X G X, it is clear that, for every D G By, the map x — > q{x,D) is continuous from X to 
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the interval [0,1]. Using the expression (51) in the expression for the mutual information 
given by (48) we obtain the following equivalent expression, 

which is clearly dependent on the channel operator F. Denoting the convolution 

/ NG{F{x),Qi){D)fi{dx) = UG{D),DeBY 
Jx 

the reader can easily verify that 

NGiF{x),Q,)i-)<UG{-) 

for fj, almost all x e X. Thus the Radon-Nikodym derivative of Ng with respect to the 
measure z^g exists and hence J{n) given by (52) is well defined. Now our objective is to 
determine the capacity of the channel by maximizing the above functional over a set of 
admissible measures on the source space subject to power constraints. That is we must find 

C = sup{J{n),fieMad} (53) 
where the set A^a^, as defined before, is any weakly compact convex subset of the set 

Sr = {fie M2{X) : I \x\\ n{dx) < rT}. (54) 
Jx 

The first question that we must address is: does the supremum exist and, if it does, is it 
attained on the set Mad- Without much additional assumptions we can prove the following 
result. 

Theorem 4.1 Suppose Mad '^s a weakly compact and convex subset of the set S,- C M2{X). 
Then, there exists a unique /x° G Mad at which J attains its supremum. In other words 
capacity is attained. 

Proof. For the existence of supremum on Mad it suffices to prove that J is weakly upper 
semicontinuos and bounded away from +oo. For uniqueness we show that J is strictly 
concave. First we prove that the functional /i — > <^(a*) is concave and weakly upper 
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semicontinuous on M{X). For simplicity of notation we revert back to our original notation 
and set Ng{F{x),Qi){D) = q{x.D),D G By. For the first statement, we show that the 
functional 

is concave. Define the measure fi G M{Y) by the convolution 

/i(D)= / q(x,D)ii(dx),D e By. 
Jx 

Since, for each D G By, x — > q{x,D) is continuous and bounded, with values from [0, 1], 
this is well defined. Choose any v G M{Y) so that q{x, •) -< v(-) for all x £ X. Such a choice 
is assured since •) is a regular Borel probability measure on By induced by the output 
process {y} having continuous sample paths. Clearly fi ^ v also. Using this measure we 
can express (55) as the sum of two terms as follows. 

Then using Fubini's theorem and interchanging the order of integration in the second term 
we find that 

= Mf)-JM = Ji{n)-Hii)- (56) 



Consider the function 



We have already noted that for every D G By, x — > q{x,D) is continuous on X with 
values in [0, 1]. This implies continuity of rj. Indeed, rj is the uniform limit of the sequence 
of bounded continuous functions {rjm} given by 

rjM^f:iog('-^)qix,Y,) 



i=l 
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where {Yi} is a partition of Y by pairwise disjoint members Yi G By- Thus /x — ^ Jiifj) is 
hnear and bounded. By definition, the second term is the relative entropy of ft with respect 
to the measure v. For an arbitrary but fixed v G v — > hi^') is a strictly convex 

functional ([7],Lemma 1.4.3, p36) on the set {v G M{Y) : hii^) < oo}. This can be easily 
verified by use of Gibb's formula and strict convexity of the function r]{S,) = ■^log^,^ > 0. 
Thus I2 is strictly convex and hence —I2 is strictly concave. Combining these facts we 
conclude that /x — > J{^) is strictly concave proving the first part of the statement. Now 
we consider the question of continuity. It is clear from the expression (56) that J\ is 
bounded linear and hence continuous with respect to the weak topology. The functional J2, 
or equivalently I2, gives the relative entropy of fi with respect to the measure v. Again, it 
is well known that relative entropy is weakly lower semicontinuous ([7], Lemma 1.4.3, p36). 
Thus J2 is weakly lower semicontinuous and hence — J2 is weakly upper semicontinuous. 
Hence the functional J given by their sum is weakly upper semicontinuous proving the 
second part of the statement. Thus we conclude that ji — > J^jj) is strictly concave and 
weakly upper semicontinuous. Next we verify that sup{J(^),/i G -Mad} < co. Clearly 
it suffices to verify that sup{J(^),/i G Sr} < 00. An alternative expression for the mutual 
information, well known in the literature, ([6], Duncan,Theorem 2, p269), and ([ll],Lipster- 
Shirayev, Theorem 16.3, pl74) is given by 

where Ft{y) = £^|Ft(a;)|.?^| with denoting the smallest sigma algebra with respect to 
which the process {y{s), s < t} is measurable. This identity holds for all finite dimensional 
stochastic differential equations like (35) and (41) with F nonanticipative and Ft taking 
values from R^. Under the assumption, p| Jj \Ft{x)\'j^m.dt < cxd| = 1, the proof is identical. 
We give a brief outline. Let j{dx x dy) with its marginals, iJ,{dx) and v{dy), be the measures 
as introduced earlier and let (3{dy) denote the Wiener measure on Y. For the system 

dy = Ft{x)dt + dW,y{Q) =0, 

it follows from absolute continuity of 7 with respect to /x x /?, and v with respect to /3, 
and Girsanov measure substitution that the Radon-Nikodym derivative of 7 with respect 
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to /X X /3 is given by 

{jidx X dy)/ii{dx) X p{dy)){x,y) = {Ft{x),dy) - {1/2) j^\Ft{x)\%ndt^^ 

= exp|(l/2) ^ \Ft{x)\^dt + jm^), dW)Y 

Similarly, the RND of v with respect (3 is given by 

{y{dy)/f5{dy)){y) = p{y) = e^v[j {Ft{y),dy) - (1/2) \Ft{y)\''dt^ 

= my), Ft{x))dt- {1/2) J^\Ft{y)\'dt + J^<Ft{y),dW)y 

where Ft{y) = E{Ft{x)\J^}. Since by assumption p| Jj \Ft{x)\'j^mdt < oo| = 1 it is clear 
that P{p{y) = 0} = and hence (3{l/p{y) < oo} = 1. From these it follows that the RND 
of 7 with respect to the product measure p x u is given by 

(7(dx X dy)/p{dx) x u{dy)){x,y) = 

= exp|(l/2) \Ft{x) - Ft{y)\lmdt + j {Ft{x) - 

Using this expression and the definition of mutual information along with standard prop- 
erties of conditional expectations, we obtain 



I{X,y) = E^log{{j{dx X dy)/p{dx) X u{dy)){x,y)} | = 
= {l/2)Eymx)f-Ft{y)fdty 

This ends the outline. Conditional expectation is a contraction map and so I{X,y) > 0, 
as expected, and we have 

J{fi) = I{X,y) < {1/2)E \Ft{x)\lmdt < {1/2) J^y^ \Ft{x)\%mdt^fi{dx). 

Since F is a bounded linear operator from X to y there exists a finite positive number K, 
independent oitel, so that \Ft{x)\'jim < ^'^\x\'l2{10 t] ii")'* ^ ^- Hence it follows from the 
above inequality that for /x G M2{X), 



J(/x) < {K^T/2) [ \x\'^fi{dx) < 
Jx 



oo. 
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Thus on Sr C M2{X), we have 

sup J(/x) < < oo 

proving that the functional /i — > J{l^) is bounded away from +00 on 5*^. Since J is weakly 
upper semicontinuous and bounded away from +00 and, by our choice, the admissible set 
■Mad C Sr is weakly compact, J attains its supremum on Mad- This proves the existence of 
a /x° G Mad ^'t which the capacity C is attained, that is, C = J{lJ.°)- The uniqueness follows 
from strict concavity of jjl — > J(/x) and convexity of Mad- This completes the proof. • 

Remark 4.2 (Some Examples of Compact sets): Here we present some simple ex- 
amples of weakly compact subsets Mad of the set Sr- (El): Let {fik} C S'r be a family 
of distinct measures, in the sense that jik ^ /x^ on Bx for k ^ m, and A a subset of ii 
satisfying the following properties: 

00 

(1) : ajfc > 0, ^ Ofc = 1 for a G A, 

k=l 

(2) : lim = 0, uniformly in a G A. 

k>N 

Define the set Ma = {// G M{X) : jj, = J2k>i ^kl-^k: ct G A}. The reader can easily verify that 
Ma C Sr and it is uniformly tight and so relatively weakly compact. Since A is compact, Ma 
is closed and hence it is weakly compact. (E2) : A variant of this example for which the same 
conclusion holds is as follows. Let Xq be a countable dense subset of the closed ball Ba{X) of 
X of radius a < rT. Then the set Ma{Xq) = {/x G M{X) : /x = J2k>i (^k^xk ,Xk e Xo,a e A} 
is a weakly compact subset of Sr- (E3): If T is any uniformly tight subset of Sr, then the set 
Mad = wci{T) given by the weak closure of T is weakly compact. (E4): Let D be a compact 
subset of the Hilbert space X and M{D) = {fi G M{X) : ijl{X \ D) = 0}. Clearly M{D) is 
a weakly compact subset of M{X). We may choose D such that Sr n M{D) / and then 
take Mad = SrC) M{D). Since X = L2{I, i?"), it follows from Sobolev embedding theorems 
that for any finite interval / and any p G [2, 00) the embedding W^'P{I, RP) ^ L2{I,R^) 
is continuous and compact. Thus a good example of a compact set Z) of X is any closed 
bounded subset D of W^'P{I,R''). 
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5 Maximizing Source Measure 



In this section we wish to present necessary conditions that a (maximizing) measure, subject 
to energy constraints and determining the channel capacity, must satisfy. Such conditions 
are called necessary conditions of optimality. In fact in the following result we have both 
necessary and sufficient conditions of optimality. 

Theorem 5.1 In order for /x° G Mad C M2{X) to be the optimum source measure, it is 
necessary and sufficient that the following inequality holds 

for all G Mad- 
Proof Let G Mad denote the optimizer, that is, 

J(/x°) = C = sup{J(/x),/x G Mad}- 

Then for any // G Mad, define fi^ = fi" + e{ii — ji") for e G (0, 1). Since Mad is convex, 
it is clear that jJL^ G Mad- Clearly jjF being the maximal element, we have J{lJi°) > Jif^^)- 
Let DJ{ii) denote the Gateaux gradient of J at ;u whenever it exists. Then computing the 
limit, 

elO I e ) 

and noting that for optimality, 

< DJ{ii"),ii- > < 0, (58) 

it is easy to verify that 

(59) 
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V /X G M-ad- The inequality (57) now easily follows from (59). For the sufficiency, recall that 
H J{ix) is concave. Hence 

< DJ{n),u- fi> > J{u)-J{n) yfi,ueMad- (60) 

Taking /j, = /j," it follows from this inequality and (57), or equivalently (58), that 

> <DJ{n°),u- 11° > > J{u)-J{fi") V veMad- (61) 

and hence J{fi°) > J{i') V i/ G M.ad- This proves the sufficiency of condition (57) thereby 
completing the proof. • 

Remark 5.2 So far in sections 4 and 5, we have assumed the source of electro-magnetic 
fields to be noise free. As seen in section 3.3, to include noisy source, wc must consider the 
evolution equations (42) and (44) which are the stochastic versions of Maxwell (field) equa- 
tions. With the source noise included, the results of Theorem 4.1 and Theorem 5.1 remain 
valid with the replacement of the Channel Kernel Na{F{x), Qi){-) by Na{F{x),Qi+Q2){-) 
where Q2 is the covariance operator associated with the source noise. For the distributed 
source model (42), the covariance operator Q2 is given by 

{Q2^, = f < K2{t, T)at), C(r) > dtdr (62) 
Jlxl 

where the kernel K2 is given in terms of the system parameters as follows: 

ftAr 

K2{t,T)= drR{t,r)R*{T,r), (63) 
Jo 

with R{t, r),0 <r <t <T, given by 

R{t, r) = gV f S{e- r)ade = f gVS{e - r)ade. (64) 

Jr J r 

The last identity follows from the fact that QV is a bounded linear operator from the state 
space H to R^ and hence closed and so commutes with the integral operation. Similar 
expressions can be derived for the boundary source. These conclusions are based on the 
properties of stochastic convolutions, see ([5], Theorem 5.2, pll9) which use stochastic Fu- 
bini's theorem ([5], Theorem 4.18, pl09). 
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Remark 5.3 (Numerical Algorithm for Computation): Based on the necessary (and 
sufficient) conditions of optimality as presented above, we can develop a gradient based al- 
gorithm for numerical computation. In particular, for simple sources like those of examples, 
(El) and (E2), the functional J(//) on Mad can be redefined as being a functional on A 
and, with slight abuse of notation, we may denote it by J (a). Our problem is to find an 
a° G A at which J attains its supremum. For the source (El), it follows from the necessary 
conditions that in order for a" G A to be the optimal distribution of weights assigned to 
the family of measures {m'^}, it is necessary and sufficient that we have. 



where 



Y,{ak-al)f L^o{x) ii\dx)>Q, V a e A, (65) 



Lao{x) = j^\og(^q{x,dy)/Y,<^k ^^{dy)^<i{x,dy), v^{dy) = j^q{x,dy)iJ'{dx). 



k>l 

For the source (E2), this further reduces to 



^(afc - al)Lao{xk) > 0, V a G A, 

k>l 

where 

La''ixk) = J^log(q{xk,dy)/ q{xk,dy)jq{xk,dy). 

For these simple sources, one can write a gradient based numerical algorithm for finding 
the optimum source. Considering the first source, the gradient of J at the n-th iteration is 
given by 

AJ(a'*) = {AfeJ(a") = J Lan{x)i/{dx),k = 1,2,---}. 

Choose 

a^+i = al + enAkJ{a''), k e N, 

with £n> sufficiently small so that a""*"^ G A. Using this a^'^'^, we compute the objective 
functional giving 

oo 

J(a"+i) = J(a") + e 5^(Afe J(a"))2 + 0{e). 

k=l 
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For £n > sufficiently small the series converges guaranteeing improvement of the objective 
functional at each step. Similar conclusion holds for the source (E2) with the gradient 
vector given by 

A J(a") = {Afe J(«") = La^ixk), k = l,2,-- •}. 

6 Conclusion 

We have presented a complete dynamic model for MIMO channels (wave guides, cavities) 
based on deterministic as well as stochastic Maxwell's equations. To the best of our knowl- 
edge, this formulation has not been considered in the literature. Both distributed and 
boundary sources have been considered. Proof of existence of maximizing source measure 
subject to power constraints has been presented. Optimality conditions have been devel- 
oped which can be used for numerical computations as indicated in remark 5.3 for simple 
source spaces. The model used for sensors in the above formulation can be extended to cover 
linear (sensor) dynamics without any difficulty. This requires inclusion of the associated 
transition operator for the construction of the output map Ft{x). It would be interesting to 
study similar problems with nonlinear sensor dynamics. 
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